Representativeness in Software Engineering Research

نویسندگان

  • Meiyappan Nagappan
  • Thomas Zimmermann
  • Christian Bird
چکیده

One of the goals of software engineering research is to achieve generality: Are the phenomena found in a few projects reflective of what goes on in others? Will a technique benefit more than just the projects it is evaluated on? The discipline of our community has gained rigor over the past twenty years and is now attempting to achieve generality through evaluation and study of an increasing number of software projects (sometime hundreds!). However, quantity is not the only important component. Selecting projects that are representative of a larger body of software of interest is just as critical. Little attention has been paid to selecting projects in such a way that generality and representativeness is maximized or even quantitatively characterized and reported. In this paper, we present a general technique for quantifying how representative a sample of software projects is of a population across many dimensions. We also present a greedy algorithm for choosing a maximally representative sample. We demonstrate our technique on research presented over the past two years at ICSE and FSE with respect to a population of 20,000 active open source projects. Finally, we propose methods of reporting objective measures of representativeness in research.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Advancing Trace Recovery Evaluation - Applied Information Retrieval in a Software Engineering Context

Successful development of software systems involves the efficient navigation of software artifacts. However, as artifacts are continuously produced and modified, engineers are typically plagued by challenging information landscapes. One stateof-practice approach to structure information is to establish trace links between artifacts; a practice that is also enforced by several development standa...

متن کامل

How healthy are software engineering conferences?

In this article we study the health of software engineering conferences by means of a suite of metrics created for this purpose. The metrics measure stability of the community, openness to new authors, introversion, representativeness of the PC with respect to the authors’ community, availability of PC candidates, and scientific prestige. Using this metrics suite, we assess the health of 11 sof...

متن کامل

Software Reliability Testing Covering Subsystem Interactions

This article proposes a novel approach to quantitative software reliability assessment ensuring high interplay coverage for software components and decentralized (sub-)systems. The generation of adequate test cases is based on the measurement of their operational representativeness, stochastic independence and interaction coverage. The underlying multi-objective optimization problem is solved b...

متن کامل

Software Process Improvement and Human Judgement Heuristics

This paper exemplifies how better knowledge about human judgement strategies known as heuristics can be used to improve software processes, especially estimation and prediction processes. Human judgement heuristics work well when they exploit a fit between their structure and the structure of the environment in which they are used. This use of environmental fit may lead to amazingly good judgem...

متن کامل

ارزشیابی کامل بودن و گویایی جغرافیایی نظام مراقبت بیماری هپاتیت B طی سال‌های 92-1385 در استان همدان: مرور تحلیلی مطالعات موجود

Background & Objectives: This study was conducted to evaluate the completeness and geographical representativeness of the hepatitis B surveillance system between 2007 and 2013 in Hamadan Province. Methods: In this descriptive study, all reported cases in Hamadan Province were extracted. The analytical literature review method was used to examine completeness. We used studies on the prevalenc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012